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A.  Description  of  Scientific  Research  Goals  _  x 

'  / 

•Oar  long-term  research  goal  is  to  develop  and  impleme/ht  speaker-independent 
continuous  speech  recognition  systems.  -Wo  oeliovo  that  the  proper  utilization  of  speech- 
specific  knowledge  is  essential  for  such  advanced  systems.  9er  research  is  thus  directed 
toward  the  acquisition,  quantification,  and  representation  of  acoustic-phonetic  and  lexical 
knowledge,  and  the  application  of  this  knowledge  to  speech  recognition  algorithms.  In 
addition,  we  are  exploring  new  speech  recognition  alternatives  based  on  -AT  and 
connectionist  techniques.) 


Significant  Results  in  the  Last  Year: 

We  developed  a  statistical  mode!  for  predicting  the  acoustic  realization  of  stop 
consonants  in  various  positions  in  the  syllable  template.  A  unification-based 
grammatical  formalism  was  developed  for  incorporating  this  model  into  the  lexical 
access  algorithm.  We  provided  an  information-theoretic  justification  for  the 
hierarchical  structure  of  the  syllable  template.  \ 


•  We  analyzed  segmented  duration  for  vowels  and  fricatives  in  continuous  speech. 
Based  on  contextual  information,  we  developed  durational  models  for  vowels  and 
fricatives  that  account  for  over  70%  of  the  variance,  using  data  from  multiple, 
unknown  speakers.  ^ 

•  We  rigorously  evaluated  the  ability  of  human  spectrogram  readers  to  identify  stop 
consonants  spoken  by  many  talkers  and  in  a  variety  of  phonetic  contexts. 
Incorporating  the  declarative  knowledge  used  by  the  readers,  we  developed  a 
knowledge-based  system  for  stop  identification.  We  achieved  comparable  system 
performance  to  that  of  the  readers.  ' 

•  We  developed  a  technique  for  phonetic  classification  using  artificial  neural  nets 
(ANN).  Vowel  classification  accuracy  was  achieved,  ranging  from  66  to  100% 
under  varying  experimental  conditions.  m-p  \ 

y  ^  r-  J  £ - - 

C  Plans  for  Next  Year's  Research: 

•  We  will  complete  the  syllable-based  lexical  access  model,  and  evaluate  its 
effectiveness. 

•  We  will  develop  durational  models  for  other  classes  of  speech  sounds.  We  will 
investigate  the  effect  of  speaking  rate  on  these  models.  We  will  use  these  results  to 
develop  a  comprehensive  model  for  segmental  duration  to  aid  speech  recognition. 

•  We  will  refine  our  ANN-based  classification  procedures  for  phonetic  classification, 
and  evaluate  their  performance  against  that  of  more  tradition  techniques. 

•  We  will  investigate  the  role  played  by  prosody  on  phonetic  recognition  and  lexical 
access. 


We  will  investigate  various  alternatives  for  adaptation  to  improve  recognition 
system  performance. 
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Signal  Processing,  pp.  429-432,  pp.  422-425.  (Conference  in  New  York,  N.Y.,  April  11-14, 
1988.) 

Leung,  H.C.,  and  V.W.  Zue,  "Two-dimensional  Characterization  of  the  Speech  Signal  and 
Its  Potential  Applications  to  Speech  Processing,"  1st  International  Conference  on 
Communication  Technology,  in  Nanjing,  China,  Nov.  1987. 


2.  Technical  Reports: 

Lamel,  Lori  F.,  "Formalizing  Knowledge  Used  in  Spectrogram  Reading:  Acoustic  and 
Perceptual  Evidence  from  Stops,"  Technical  Report,  Research  Laboratory  of  Electronics, 
Massachusetts  Institute  of  Technology. 


3.  Presentations: 

Invited: 

Zue,  Victor  W.,  "Phonetically-Based  Approach  to  Automatic  Speech  Recognition,"  21st  IBM 
Computer  Science  Symposium  on  "Mechanization  of  Intelligence  and  Brain  Model," 
Sendai,  Japan,  November  1987. 

Zue,  Victor  W.,  "Discovery  of  Phonetic  Regularities  in  the  Acoustic  Signal,"  CHABA  Annual 
Meeting  in  Washington,  D.C.,  January  1988 


Zue,  Victor  W.,  "Speech  Recognition  Research  at  MIT,"  University  of  Florida,  Gainesville,  FL, 
April  1988. 

Randolph,  Mark.  "An  augmented  Context-Free  Parsing  Algorithm  and  its  use  in  Speech 
Recognition,"  paper  presented  at  1988  IEEE  workshop  on  Speech  Recognition,  New  York, 
May  31-  June  3, 1988. 


Leung,  Hong,  "Vowel  Recognition  Experiments  Using  Artificial  Neural  Nets,"  1988  IEEE 
Workshop  on  Speech  Recognition,  New  York,  May  31 -June  3. 
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Contributed: 

Lamel,  Lori  F.,  "Identification  Of  Stop  Consonants  From  Continuous  Speech  In  Limited 
Context:  Acoustical  Society  of  America,  November  1 987. 

Leung,  Hong.  "Recognition  of  Vowels  Using  Artificial  Neural  Networks,"  Acoustical  Society 
of  America,  May  1988. 
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4.  Books  (and  sections  thereof) 

Zue,  Victor.  "Automatic  Speech  Recognition  and  Understanding",  a  chapter  in  A.I.  in  the 
1 980s  and  Beyond,  p.  185-200,  Cambridge:  MIT  Press,  1987. 
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Papers  Submitted  to  Refereed  Journals  (and  not  yet  published):  0 

Papers  Published  in  Conference  Proceedings:  3 

Books  (and  sections  thereof)  Submitted  for  Publications:  0 
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Invited  Presentations  at  Topical  or  Scientific/Technical  Society  Conferences:  5 
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Number  of  Post  Docs:  0 
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incorporate  knowledge  obtained  system  was  implemented  incorporating 

from  experiments  and  from  acoustic  descriptions  and  reasoning  used 

spectrogram  reading  in  a  knowledge-  by  spectrogram  readers 
based  system  •  system  performance  is  within  1 0%  of 

spectrogram  readers’ 
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